Acoustic Modeling with Bootstrap and Restructuring Based on Full Covariance

نویسندگان

  • Xiaodong Cui
  • Xin Chen
  • Jian Xue
  • Peder A. Olsen
  • John R. Hershey
  • Bowen Zhou
چکیده

Bootstrap and restructuring (BSRS) has been shown in our previous work to be superior over the conventional acoustic modeling approach when dealing with low-resourced languages. This paper presents a full covariance based BSRS scheme, which is an extension of our previous work on diagonal covariance based BSRS acoustic modeling. Since full covariance provides richer structural information of acoustic model compared to its diagonal counterpart, it is advantageous for both model clustering and refinement. Therefore, in this work, full covariance is employed in BSRS to keep the structural information until the last step before being converted to diagonal covariance for practical applications. We show that using full covariance further improves the performance over diagonal covariance in the BSRS acoustic modeling framework under the same model size without increasing computational cost in decoding.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Towards High Performance LVCSR in Speech-to-Speech Translation System on Smart Phones

This paper presents the endeavors to improve the performance of large vocabulary continuous speech recognition (LVCSR) in speechto-speech translation system on smart phones. A variety of techniques towards high LVCSR performance are investigated to achieve high accuracy and low latency given constrained resources. This includes one-pass streaming mode decoding for minimum latency, acoustic mode...

متن کامل

Dimensional reduction, covariance modeling, and computational complexity in ASR systems

In this paper, we study acoustic modeling for speech recognition using mixtures of exponential models with linear and quadratic features tied across all context dependent states. These models are one version of the SPAM models introduced in [1]. They generalize diagonal covariance, MLLT, EMLLT, and full covariance models. Reduction of the dimension of the acoustic vectors using LDA/HDA projecti...

متن کامل

A High Order Approximation of the Two Dimensional Acoustic Wave Equation with Discontinuous Coefficients

This paper concerns with the modeling and construction of a fifth order method for two dimensional acoustic wave equation in heterogenous media. The method is based on a standard discretization of the problem on smooth regions and a nonstandard method for nonsmooth regions. The construction of the nonstandard method is based on the special treatment of the interface using suitable jump conditio...

متن کامل

Modeling with a subspace constraint on inverse covariance matrices

We consider a family of Gaussian mixture models for use in HMM based speech recognition system. These “SPAM” models have state independent choices of subspaces to which the precision (inverse covariance) matrices and means are restricted to belong. They provide a flexible tool for robust, compact, and fast acoustic modeling. The focus of this paper is on the case where the means are unconstrain...

متن کامل

Bootstrapping a Compiler for an Equation-Based Object-Oriented Language

What does it mean to bootstrap a compiler, and why do it? This paper reports on the first bootstrapping of a full-scale EOO (Equation-based Object-Oriented) modeling language such as Modelica. Bootstrapping means that the compiler of a language can compile itself. However, the usual application area for the Modelica is modeling and simulation of complex physical systems. Fortunately it turns ou...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011